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ABSTRACT 

The availability of a useful tool for simple and timely detection of the most important virulent varieties of 
Escherichia coli is indispensable. To this end, bacterial DNA pools which had previously been categorized 
were obtained from isolated colonies as well as selected in terms of utilized phenotype; the pools were 
assessed by two PCR Multiplex for the detection of virulent E. coli eaeA, bfpA, stxl, stx2, ipaH, ST, LT, and 
aatA genes, with the 16S gene used as DNA control. The system was validated with 66 fecal samples and 44 
wastewater samples. At least one positive isolate was detected by a virulent gene among the 20 that were 
screened. The analysis of fecal samples from children younger than 6 years of age detected frequencies of 
25% LT positive strains, 8.3% eae, 8.3% bfpA, 16.7% ipaH, as well as 12.5 % aatA and 5T. On the other 
hand, wastewater samples revealed frequencies of 25.7% eaeA positive, 30.3% stxl, 15.1% LT and 19.7% 
aatA. This study is an initial step toward carrying out epidemiological field research that will reveal the 
presence of these bacterial varieties. 
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INTRODUCTION 

Diarrhea continues to be a major cause of illness and 
death worldwide. According to the World Health Organization 
(WHO), 1.4 million pre-school children die annually due to 
diarrhea resulting from intestinal infection (19). 

The Pan-American Health Organization (PAHO) reports 
that the average rate of death from acute diarrhea in children 
younger than 5 years of age in Latin America and the 



Caribbean is 4.1%. The regions with the highest rates are 
Central America (7.8%), the Caribbean (5.5%), and 5.1% in the 
Andean region (23). Costa Rica, for example, recorded 450 
deaths due to diarrhea during 2001-2005, of which only 20 
were found to have an identified causal agent (18). 

Escherichia coli is a Gram negative bacillus, a normal 
microorganism in humans, but one which can produce 
symptoms of diarrhea when virulent factors are acquired, 
among which are enterotoxins, adhesines, and colonization 
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factors (33). 

Diarrheogenic varieties of E. coli exhibit a wide range of 
clinical symptoms that include traveler's diarrhea and hemorrhagic 
diarrhea (24) as well as cases produced by zoonosis (14). To date, 
six varieties have been clearly described and characterized by 
virulence factors capable of causing diarrhea in humans: 1) 
enterotoxigenic E. coli or ETEC, whose most distinctive genes are 
the stable thermotoxin -st- and thermolabile -It-; 2) 
enteropathogenic E. coli or EPEC, whose characteristic genes are 
the intimine -eaeA- and the bundle forming pilus -bfp-)\ 3) shiga- 
toxin E. coli or STEC, whose toxins are encoded in the stxl y stx2 
genes, 4) enteroinvasive E. coli or EIEC, one of whose 
characteristic virulence traits is the ipaH, which belongs to the 
invasion plasmid; 5) enteroaggregative E. coli or EAEC, with the 
pCVD432 plasmid, for which the aatA gene is one of the most 
stable regions; and 6) diffusely adherent E. coli or DAEC, whose 
virulence genes have yet to be fully profiled (1 1, 22). 

A number of different procedures have been described for 
identifying these pathogens since this cannot be carried out based 
on their phenotypical features. These strategies have ranged from 
cultivating cells to bio-molecular identification, such as Multiplex 
PCR (1, 3, 11, 20, 21). This means that routine detection of these 
varieties is costly. Moreover, the literature demonstrates that the 
frequency of virulent E. coli varieties is much lower than that of 
non-virulent varieties, thus making it necessary to screen various 
isolates per sample in order to detect them (10). Costa Rica and 
most other Latin American countries lack epidemiological data on 
the circulation of diarrheogenic E. coli as well as on its impact on 
public health. The main goal of the present study was to design a 
quick and easy screening system that would make analysis of the 
greatest number of E. coli isolates possible, as well as to determine 
the applicability of the system in both fecal samples from children 
less than 6 years of age and in wastewater samples taken from 
stabilization or settling ponds. 

METHODOLOGY 

Samples 

Sixty six fecal samples were collected through a community 
project at the University of Costa Rica (TCU-350) with children 



younger than 6 years of age in Montes de Oca county in San Jose 
province, Costa Rica, during February and March of 2007. These 
samples were inoculated in MacConkey agar (Oxoid®) within 12 
hours after collection, and were incubated at 35 °C for 24 hours. 
Moreover, 24 water samples were collected for over a year (from 
September of 2007 until December of 2008) from inlet and outfall 
points of the Costa Rican Water Supply and Sewer Institute's 
settling ponds located in Nicoya, Canas, Liberia, Santa Cruz, and 
Perez Zeledon. The wastewater samples were processed by the 
most probable number (MPN) technique for fecal coliforms as 
described by the American Public Health Association (2). After 
the presumptive phase, a mix was made of all tubes that tested 
positive by dilution. Each subsample (subsamples being 
understood as each dilution of an analyzed sample that had tested 
positive) was then inoculated into MacConkey agar (Oxoid®) and 
incubated at 35 °C for 24 hours. 

Gene pool system 

Twenty colonies from the bacterial cultures of the feces and 
wastewater samples obtained in MacConkey agar (Oxoid®) were 
inoculated in a single tube of soy trypticase broth (Oxoid®) and 
then incubated at 35 °C for 24 hours. In order to include atypical 
E. coli morphotypes, all the pools were prepared with 25% lactose 
negative colonies and 75% lactose positive colonies. To be able to 
assess the detection limit of the method, bacterial pools containing 
positive control colonies and strains with no virulence factors were 
utilized; the ratio used was 1:19 (1 positive colony was inoculated 
with a known virulence factor, along with 19 bacterial strains with 
not known virulence factors). Pools with 1+1:18 ratios, in which 
colonies that were carriers of different virulence factors were 
mixed with strains with not known virulence factors, were also 
evaluated. 

DNA extraction and quantification 

The extraction of the DNA was carried out in accordance 
with the Franck thermic protocol (7). The extracted DNA material 
was stored at -70 °C until being used. The quantification of the 
DNA from each pool was carried out with an Epependorff Plus® 
biophotometer, with a dilution factor of 1:200. 
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Polymerase Chain Reaction (PCR) 

Two Multiplex PCR and one simple PCR, which had been 
standardized in the laboratory, were used to process the gene 
pool samples. For the PCR Multiplex, the following reagent 
concentration was utilized: free buffer Mg IX Fermentas®, 
MgCl 2 1.75 mM Fermentas®, dNTP 200 uM Fermentas®, 0.6 
U of Taq polymerase Fermentas® per reaction, and 4 pi of 
bacterial DNA. The primers utilized for the PCR Multiplex 1 
were as follows: 5'GAC CCG GCA CAA GCA TAA GC 3' 
and 5'CCA CCT GCA GCA ACA AGA GG 3' for the eaeA 
gene (384 pb, 0.04 ug/reaction), 5 'A AT GGT GCT TGC GCT 
TGC TGC 3' and 5' GCC GCT TTA TCC AAC CTG GTA3' 
for the bfpA gene (324 pb, 0.03 ug/reaction), 5 'CTG GAT TTA 
ATG TCG CAT AGT G3' and 5 'AGA ACG CCC ACT GAG 
ATC ATC3' for the stxl gene (150 pb, 0.045 ug/reaction), 
5'GGC ACT GTC TGA AAC TGC TCC 3' and 5 'TCG CCA 
GTT ATC TGA CAT TCT G3' for the stxl gene (250 pb, 0.06 
ug/reaction) (13) and 5 ' GTTCCTTG ACCGCCTTTCCGATA 
CCGTC3' and 5 ' GCCGGTC AGCC ACCCTCTGAGAGTAC 
3' for the ipaH gene (600 pb, 0.05 ug/reaction) (4). The 
primers utilized for the Multiplex PCR 2 were as follows: 5' 
TCT GTA TTG TCT TTT TCA CC 3' and 5' TTA ATA GCA 
CCC GGT ACA AGC 3' for the st toxin (186 pb, 0.4 
ug/reaction) (8), 5' TCT CTA TGT GCA TAC GGA GC 3' 
and 5' CCA TAC TGA TTG CCG CAA T 3' for the It toxin 
(322 pb, 0.02 ug/reaction) (30) and 5 'CTG GCG AAA GAC 
TGT ATC AT 3' and 5 C AATGTATAGAAATCCGCTGTT 
3' for a segment of the pCVD423 aatA gene (630 pb, 0.045 
ug/reaction) (27). Finally, a simple PCR with MasterMix 
Fermentas® 2X was used for the 16S; as primers, 
5 ' GGGATG A A AGTT A ATACCTTTGCTC 3' and 5' 
TTCCCGAAGGCACATTCT 3' (584 pb) were used (31). 
The amplification was done in a DNA thermal cycler (Applied 
BioSystem®) using 5 min at 95 °C for initial denaturation 
followed by 35 cycles of 92°C (denaturation), 55 °C 
(annealing) and 72 °C (primer extension), for 45 seconds each, 
with a final 10 min extension at 72°C). For the simple PCR, the 
amplification conditions were a initial denaturation for 5 



minutes at 94 °C, followed by 35 cycles of PCR divided in 94 
°C (denaturation), 56 °C (annealing) and 72 °C (primer 
extension), for 30 seconds each, with a final extension of 2 
minutes at 72 °C. The detection limit for all the PCR Multiplex 
and simple PCR reactions was assessed by utilizing quantified 
bacterial DNA dilutions (between 10 and 60 ng/ul) and the 
specificity of the PCR was assessed by using other members of 
the family Enterobacteriaceae . PCR products were visualized 
by 2% agarose gels (TopVision™ Fermentas®) with 0.05% 
ethydium bromide. The electrophoresis processes were carried 
out in a Fisher Scientific® electrophoresis chamber at 13.5 
V/cm for 60 minutes. DNA fragment sizes were compared to a 
molecular weight marker (Fermentas®), with a 50 base pair 
ladder. 

RESULTS 

PCR 

The optimum conditions of each PCR reaction were 
standardized in the laboratory and detailed in the methodology. 
The minimum concentration of DNA detected was of 20 ug/ml 
for the 16S, the stxl, stxl, and ipaH genes, and 30 ug/ml for 
the eaeA and bfpA genes. In the Multiplex 2 reaction, the 
minimum concentration of DNA detected was 40 ug/ml. For 
which extractions of DNA with concentrations of between 60 
and 200 ug/ml were used. In the specificity assays 
amplification was observed only for the 16S gene of Shigella 
sp. strain, whereas no amplification was observed in any of the 
other genes and strains. 

Gene pool system 

The detection limit of the gene pool method was evaluated 
in order to determine the capacity of detecting the virulence 
genes of pathogenic E.coli strains, which were present in the 
pool in a much lower concentration as compared to the 
predominant non virulent E. coli strains (1+19). Before the 
PCR analysis, the DNA obtained for each gene pool was 
quantified, and positive detections were obtain from the DNA 
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pools in which only one strain was the carrier of virulence 
factors (1:19); this was done in quintuplicate for each gene 
under study (Figure 1). To evaluate the simultaneous detection 
of the genes, mixed pools were prepared from the positive 



control strains, an effective detection was achieved for each 
one of the specific genes. This was carried out in quintuplicate 
for the eaeA+/bfpA+, stxl, stx2, ipaH combinations, and for 
the ST+/LT+, aatA combination. 





123 4567 8 9 123456789 10 11 12 

Figure 1. A. Multiplex PCR for bacterial positive pool of genes ST, LT y aatA in the ratio 19:1. 1. Molecular weight marker, 
2.DNA pool eae+/bfpA+, 3. DNA pool ST+/LT+, 4. DNA pool aatA+, 5. DNA pool eae+/stxl+, 6. DNA pool eae+/stx2+, 1. 
DNA pool ipaH+, 8.DNA pool 5T?+/Lr+/aatA+, 9. Reagent control. B. Multiplex PCR for bacterial positive pool of genes eae, 
bfpA, stxl, stx2 e ipaH in the ratio 19:1. 1. Molecular weight marker, 2.DNA pool eae+/bfpA+, 3. DNA pool ST+/LT+, 4. DNA 
pool aatA+, 5. DNA pool eae+/stxl+, 6. DNA pool eae+/stx2+, 7. DNA pool ipaH+, 8. DNA pool eae+l bfpA+lstxl+lstx2+/ 
ipaH+, 9. DNA pool ST+/LT+/aatA+, 10. DNA E. coli ATCC 25922, 11. DNA Enterobacter cloacae, 12.Reagent control 



Fecal samples 

Of the 66 samples analyzed, 1 was discarded since the 16S 
gene was not detected by PCR; samples showing negative 
results for the 765 gene were excluded since with the 
established conditions, their detection would be found to be 



below the sensitivity limit. Thirty-seven percent of the samples 
yielded negative results, 34% of the samples tested positive for 
more than one virulence factor, and 24% of the samples tested 
positive for just one virulence factor. The specific results are 
detailed in Table 1 . 



Table 1. Detection frequency (%) in fecal and wastewater samples (input and output of ponds) analyzed virulence genes of 



Escherichia coli. 



Analyzed gene 


Fecal sample 
frequency (%) 
n=66 


Input ponds sample 
frequency (%) 
n=13 


Output ponds sample 
frequency (%) 
n=ll 


General ponds 
frequency (%) 
n=24 


St 


21,2 


7,7 




12,5 


It 


15,1 


23,1 


27,3 


25 


eaeA 


25,7 


23,1 


27,3 


21 


bfpA 




7,7 


9,1 


8,3 


ipaH 




15,4 


18,2 


16,7 


stxl 


30,3 








stxl 










aatA 


19,7 


7,7 


27,3 


12,5 
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Wastewater samples 

Of the 24 samples taken from settling ponds, which were 
analyzed as 77 subsamples, no samples were discarded since in 
every case at least one subsample showed amplification of the 
16S gene. Thirteen samples were taken from pond inlets, and 
1 1 were taken from their out fall points. Of the total of samples 
analyzed, 46% were negative for all virulence factors under 
study, 33.3% were positive for at least one virulence factor, 
and 12.5% were positive for just one virulence factor. The 
relative frequencies found are detailed in Table 1 . 

DISCUSSION 

The gene pool system is an innovative element in this 
study. It has been widely documented that detection of 
diarrheagenic varieties of E.coli is methodologically complex 
because this bacterial group is found in low concentrations in 
fecal samples, and then as a result, analysis from 1 to 5 isolates 
can yield low detection frequencies (10, 16). We propose that 
this newly described tool can be used to overcome this 
limitation. 

To work adequately, the gene pool system requires 
adjustments and standardization of the quantity of genetic 
material obtained in order to generate reproducible results (28). 
In the case of this PCR, to standardize the optimum conditions 
for amplification and to prevent false negatives because of low 
DNA concentration, assays were carried out to determine the 
minimum concentration of DNA necessary for detection of the 
expected products (between 60 and 200 pg/ml); thus, the 
results obtained are reproducible. 

Moreover, the implemented detection system was assayed 
in two distinct matrixes in order to evaluate its versatility. 
Fecal samples were chosen due to the fact that in Latin 
America it is common to find these bacterial groups among, 
both symptomatic and asymptomatic populations (26). On the 
other hand, settling ponds were selected because they are 
ecosystems with a great variety of microorganisms where it is 
likely to find potentially virulent strains (32). 



The feasibility of detecting diarrheagenic E. coli in the 
analysis of samples from wastewater in settling ponds by 
enhancing the most probable number (MPN) approach is 
evident. This is what makes this procedure much more 
accessible than current alternatives, such as concentration 
techniques by "in situ" filters, or filtering through membranes 
(2). 

An important thing to note is that all the positive dilutions 
have to be analyzed by the enhanced approach since, the 
presence of microorganisms does not follow a statistically 
normal distribution (2). This fact was corroborated in this study 
upon finding the presence of microorganisms in different 
dilutions of a given sample, regardless of the composition of 
dilutions themselves. 

The settling ponds studied have retention periods of 
approximately three months, and their effectiveness, from the 
microbiological standpoint, is measured by the capacity to 
diminish the microbial load in at least three logarithms (25). 
Nevertheless, the survival of pathogenic microorganisms has 
not been studied in detail. In addition to the detection of the 
genes under analysis, differences were found among the 
profiles of genes across the samples taken from the pond inlet 
and outfall points (Table 1). 

This phenomenon can be explained from the theoretical 
point of view given that the genome of diarrheagenic varieties 
of E. coli is common to varieties found in the wild, which 
allows both groups of bacteria to survive in the greater 
environment (9). Moreover, there is evidence that some 
varieties, such as the Shiga toxin producing E. coli, can 
withstand adverse conditions such as changes in temperature or 
pH level, and even desiccation. Such evidence suggests that 
settling ponds are ecosystems with the ability to promote 
survival of potentially pathogenic bacteria even though the 
ponds are efficient in reducing the number of microorganisms 
that end up exiting the system. This survival may mean that the 
ponds recirculate the major virulent varieties of E. coli and 
release them into the environment at different moments, which 
could explain the variation in the profiles of virulent genes 
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found across the ponds inlet and outfall points. 

On the other hand, the system proved capable of detecting 
diarrheagenic E. coli in fecal samples as well. It is important to 
emphasize that although that the samples were taken from 
asymptomatic children, the frequencies detected were greater 
than those usually described, in which only 1 to 3 isolations per 
sample are analyzed (6). Among the hypotheses to explain this 
finding are the following: improved detection thanks to the use 
of gene pools, a high prevalence of pathotypes in the 
population, and the fact that these pathogenic varieties are 
regulated by the surroundings (17, 9). However, future studies 
are needed in order to analyze this phenomenon in detail. 

In the fecal and wastewater samples analyzed, genes were 
simultaneously found that are characteristic of both ETEC and 
EPEC; this is a trend that has been described for a number of 
years, and which at the clinical level produces more severe 
symptoms than those generated by either of these varieties 
individually (5). These findings can serve as a guide for better 
treatment in the health sector for patients with acute diarrhea 
that present these co-infections. Nonetheless, studies of this 
nature are scarce, as mentioned previously, due to routine 
analysis being carried out by means of profiling 1 to 3 
isolations on an individual basis, which could overlook 
identification of other pathogenic varieties present in a given 
sample. 

Another important finding is the high frequency at which 
the eaeA gene was detected, both in wastewater and fecal 
samples; this could indicate the presence of atypical ECEP 
varieties in the country, which is congruent with findings 
worldwide, where the atypical EPEC varieties are more 
frequent than typical ones (1). 

Finally, although methodologies currently exist to analyze 
microorganisms directly in the sample which improves 
response time, with this system, the cultivation of 
microorganisms ensures their viability and avoids the problems 
created by substance inhibitors in the sample for detection by 
PCR. (12, 29) 

To summarize, the present study demonstrates the 



efficiency of implementing a gene pool system for detecting 
diarrheagenic E. coli. This tool is useful for analysis of 
wastewater and fecal samples, and its implementation in the 
field is an initial step toward carrying out epidemiological 
studies to assess the incidence, prevalence, and persistence of 
certain diarrheagenic E. coli varieties. This information can 
make it possible to orient authorities as to the proper means of 
environmental control when handling wastewater, as well as to 
guide those in the clinical field as to the best course of 
treatment for patients infected with one or more pathogenic 
varieties of E. coli. 
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